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ABSTRACT 

The red sequence is an important feature of galaxy clusters and plays a crucial 
role in optical cluster detection. Measurement of the slope and scatter of the red 
sequence are affected both by selection of red sequence galaxies and measurement 
errors. In this paper, we describe a new error corrected Gaussian Mixture Model 
for red sequence galaxy identification. Using this technique, we can remove the 
effects of measurement error and extract unbiased information about the intrin- 
sic properties of the red sequence. We use this method to select red sequence 
galaxies in each of the 13,823 clusters in the maxBCG catalog, and measure the 
red sequence ridgeline location and scatter of each. These measurements provide 
precise constraints on the variation of the average red galaxy populations in the 
observed frame with redshift. We find that the scatter of the red sequence ridge- 
line increase mildly with redshift, and that the slope decreases with redshift. We 
also observe that the slope does not strongly depend on cluster richness. Using 
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similar methods, we show that this behavior is mirrored in a spectroscopic sample 
of field galaxies, further emphasizing that ridgeline properties are independent of 
environment. These precise measurements serve as an important observational 
check on simulations and mock galaxy catalogs. The observed trends in the 
slope and scatter of the red sequence ridgeline with redshift are clues to possi- 
ble intrinsic evolution of the cluster red-sequence itself. Most importantly, the 
methods presented in this work lay the groundwork for further improvements in 
optically-based cluster cosmology. 

Subject headings: Galaxies: clusters - Cosmology: observations - Methods: Data 
analysis, Gaussian Mixture, Bootstrap 



1. Introduction 



Galaxy clusters are the largest gravitationally bound systems in our Universe, whose 
masses, abundance and spatial dist ribution reflect the growth of structure, c omposition, and 
expansion history of the Universe (lEvrardl Il989l ; lOukbir fc Blanchardl Il992l ) . The feasibility 
of constraining cosmological parameters using galaxy clusters has be e n dem onstrated by 
many authors iMaiumdar fc Mohrl 120041 : Ell 120031 : iLima fc Hul I2004I . l2005h and realistic 
constraints on cosmologic al parameters from o ptica lly selected galaxy clusters have been 
impleme nted recently by IGladders et al.l (120071 ) and iRozo et al. d20Q9h on the RCS cluster 
catalog ( Gladders &: Yee 20051 ) and maxBCG catalog ( Koester et al. 2007a bl) respectively. 



The predominantly 



1.4 (IBower et al. 



Blakeslee et al 



20031 : 



red, bright, passively evolving red sequence, or "E/S0 ridgeline" (jVisvanathan fc Sa 



19771 : lAnnis et al.l Il999) fou n d in the cores of clusters of varied richness up to at least 



1992: 



Smail et al. 



Mullis et al 



2005: 



199 



van Dokkum et al. 



Eisenhardt et al 



20051 : 



199 



Barrientos 1999 



De Lucia et al 



20071 ) provide 



an efficient means for cluster detection, and have been an integral part of modern cluster 



cosmology. The red sequence itself is ubiquitous in the galaxy population (IRenzinil 12006 
e.g.), and i n clusters red sequence galaxies dominat e the bright end of the cluster luminosity 



function (ISandage et al.l 11985c iBarger et al.lll998l ). They are extremely tightly clustered 
in color space, co ntaining old populatio ns of stars whose observed color varies smoothly 
with redshift (e.g. IGladders fc Yedl2000l ). The pervasiveness of this phenomenon in clusters 
enables efficient optical cluste r detection while a fortuitous color-redshift relation yields ac- 
curate photometric redshifts ( Gladders fc Yee 2000 : Koester et al. 2007b}). Simple counting 
of photometrically identified cluster red sequence galaxie s (IKoester et al.l l2007al. e.g) has 



also been shown to be an e ffective proxy for cluster mass (IBecker et al.l 120071 : ISheldon et al. 



20071 : 1 Johnston et al.l 120071 ) . with more sophisticated applications yielding improvements in 
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richness as a cluster mass proxy (IRozo et al.ll2008l ). In the era of precision measurements, 
the extent to which the red sequence can be exploited for cluster cosmology depends on how 
accurately its characteristics can be measured at a given redshift. 

In addition to its relevance to cluster cosmology, the red sequence plays an important role 
in constraining the complex physical processes that drive galaxy formation and e volution. At 



the field scale this inclu des measurements of the red galaxy luminosity function (W ake et al. 



2006 



2005 



Faber et a 



Coil et al 



2007), the clustering of red galaxies in various environments (IZehavi et al. 
2008| ), and color-m agnitude relations of spectroscop i cally and morpholog i 



cally identified early- type galaxies (jCool et al.ll2006l ; iMei et al.ll2006l : IStanford et al.lll998l ). 
The high density environments of clusters of galaxies are dominated by red sequence galax- 
ies; the red sequence portion of the galaxy populations in the cores of rich clusters to at 
least z ~ 1 form the basis for various monolithic collapse scenarios (e.g. Bower et al. 1992 



Blakeslee et al. 


2003: 


Mei et al. 


2009 


)• 


Faber et al. 


(2007) 



to fill out a picture of galaxy formation that includes a mechanism for the formation of the 
red sequence. 

In color-magnitude space, the red sequence is typically characterized by its slope, zero 
point, and scatter. Many theoretical modeling have been prop osed to explain the red se- 
quence (e.g. lArimoto Yoshiill 19871 ; iKauffmann k, Charlotlll998l ). Various models posit that 



in the rest frame, the scatter in the red sequence is driven primarily by age effects, its slope 
is a manifestation of the mass-metallicity relati on, and the zero point is set by combina- 
tion of age and m ass-metallicity differences 
Faber et al.ll2007h . 



e.g. 



Bernardi et al.ll2005l : iDe Lucia et al.ll2007l : 



Studies of the cluster red sequence hav e been accomplished by simply measuring the 



photometric colo r-magnitude relation (e.g. lLopez-Cruz et al.l 12004 ; IDe Lucia et al. 



Stott et al.ll2009l ). supplemented with HST morphological i nformation (e.g 



2007 



Gladders et al. 



1998h and spectroscopy at higher redshift jMei et al-lboOfl Istott et al.ll2009h . Extra 



mor- 



phological and spectroscopic data allow precise separation of E and SO-types from the rest of 
the galaxy population , as well as refined identification of cluster members (jBlakeslee et al. 



20031 ; IMei et al.ll2009l ). The situation also benefi ts significantly from prec ise color measure- 



ments afforded by deep, CCD-based imaging (e.g lvan Dokkum et al.lll998l ). In the literature, 
the red sequence has been measured with various levels of scrutiny in dozens of individual 
clusters. 

In the past several years, researchers have turned to the considerable resources of the 
the Sloan Digital Sky Survey (SDSS) and simi lar wide field surveys to probe red sequence 



and elliptical gal axies in various environments (IHogg et al.ll2004t iBernardi et al.ll2005l . 12006 



Cool et al.ll2006l ). These studies include both spectroscopically and morphologically iden- 
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tified red galaxies at z ~ 0.1 that aim to constrain galaxy evolution scenarios for the 
cosmoloKically- r elevant luminous red galaxy (LRG) samples that extend to z ~ 0.6 (e.g. 



Cool et all 120061 ) . 



With the maxBCG cluster catalog, we are positioned to use the SDSS to make one of 
the most statistically robust photometric measurements of the cluster red sequence, using 
nearly 14,000 clusters between 0.1 < z < 0.3. In this paper we focus on the slope and 
scatter of the red sequence. We clearly show the systematic effects photometric errors have 
on the measurement of the underlying slope and scatter of the red sequence, and introduce a 
method for properly handling these effects. This method, based on an error-corrected Gaus- 
sian Mixture Model (ECGMM), reliably recovers the properties of the ridgeline by taking 
measurement errors into account. After presenting the method, we describe its application 
to measurement of maxBCG clusters. Of particular relevance to cluster cosmology are the 
observed mean, scatter, and slope of the E/S0 ridgeline for all maxBCG clusters. These 
results are presented, along with a discussion of observed trends with redshift. 



2. Methods 

2.1. Intrinsic properties of red sequence ridgeline 

The existence of the red sequence ridgeline is evidence that cluster galaxies are clustered 
in color space in addition to real space. The emission from early-type galaxies is dominated 
by old stellar populations, which gives rise to these remarkably similar galaxy colors. In 
addition, there is a close mapping between galaxy color and redshift for these galaxies as 
a result of the restframe 4000 A break in their spectra. For the SDSS filter sets, the 
4000 A break is within the g band as long as the redshift is below 0.35. Therefore, the most 
informative ridgeline color for the maxBCG catalog is g — r. 

In the projected vicinity of a detected cluster, there are both cluster member galaxies 
and field galaxies. Red sequence galaxies form a part of the member population, whose 
colors are clustered tightly and can be approximated with a Gaussian distribution with 
narrow width. On the other hand, the field galaxies' and blue member galaxies' colors are 
not tightly clustered and can be approximated by a Gaussian distribution with a broader 
width_|. The problem of separating the ridgeline from the field can be specified as following: 



1 There are complicated situations where the distribution in color space is not simply unimodal or bimodal, 
for example when two clusters are seen in projection. For maxBCG catalog, it covers about 7,500 square 
degrees with about 14,000 clusters. This leads to about 2 clusters per square degree. Each cluster is about 
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What are the two Gaussian components (one for the ridgeline and one for the field) that 
represent the color distribution in the vicinity of a galaxy cluster? If this double Gaussian 
is an adequate model for describing this color distribution, the one dimensional Gausian 
Mixture Model (GMM) is well suited to the problem. 



In the traditional applications of GMM (iTitterington et al.lll985l ). measurement errors 



are not considered. In our case, there are non-negligible measurement errors associated 
with the galaxy colors. We are interested in measuring the intrinsic color scatter of clus- 
ter members, absent contamination by the increasing measurement errors of faint galaxies. 
Without accounting for the increasing photometric errors, we expect that the color scatter 
will increase as the measurement errors become larger. While the intrinsic color scatter may 
increase as redshift increases (because the 4000 A line break is shifting toward r band and 
making the g — r color less discriminative.), measurement errors may make us overestimate 
the increase in intrinsic scatter with redshift. To avoid this problem, we include measure- 
ment error into our likelihood function to remove the contamination. We will refer to this 
as the error-corrected Gaussian Mixture Model and derive the corresponding Expectation 
Maximization (EM) recursive relation in the following section. 

It is clear that we can always improve the fit by adding more Gaussian components, al- 
though this is clearly not good in the sense of parsimony. So, we need to somehow decide on 
the number of Gaussian components by trading off quality of fit against the number of intro- 



duced free parameters. To do so, we use the Bayesian Information Criterion (BIC) (jSchwarz 



19781 ; IConnolly et al.l 120001 ) to determine how many mixtures we should use. The BIC is 



defined as: 



BIC = -2\ogC max + k\og(M) (1) 

Where k is the number of free parameters. For mixture models with different number of 
components, we compare their corresponding BIC and select the model with the smallest 
BIC. 



2.2. Error-corrected Gaussian Mixture Model 

In what follows, we describe how to fit a multi-component Gaussian mixture model to a 
one dimensional distribution of data with both intrinsic scatter and Gaussian measurement 



the size of a few arcminutes across, so the chance of two or more overlapped clusters is low. Therefore, a 
unimodal or bimodal distribution in color space is a good approximation. 
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errors. Our method is an ext ension of the traditional expectation maximization method for 
GMM rtPempster et~aDll977h . 



We assume the data are to be modeled by a mixture of N Gaussians fit to the distribution 
of M data points. The subscript % cycles through N and j cycles through M, and we use 
Hi, Oi and W{ to denote the location, width and weight of each Gaussian component, yj and 
Sj denote the data points and their measurement errors which are assumed to be Gaussian. 
For brevity, we denote the parameters (/ij, cr, and collectively by 9. Then the likelihood 
of the parameters given the data and measurement errors is: 



M 



N 



Wi 



: exp 



(2) 



The optimal parameters 9 could be estimated by maximizing the above likelihood func- 
tion. The Expectation Maximization algorithm provides an efficient way to get the maximum 
likelihood estimators in such a setting. To utilize this, we need to introduce a hidden vari- 
able, Zj, which tells which Gaussian component the data point yj is sampled from. In our 
case, different from the standard EM prescription, we have non-negligible measurement er- 
rors present. After some algebra, we arrive at a set of recursive relations that lead to the 
maximum of the likelihood (see appendix for details). 



2.3. Bootstrapping to increase the robustness of ECGMM 

Though the ECGMM is generally stable for estimating the parameters of the Gaussian 
Components, it can fail occasionally due to very inappropriate choice of initial parameters or 
some very big measurement errors of certain galaxies. To make our measurement more ro- 
bust, we introduced a bootstrap-like scheme. Suppose we have M data points. We randomly 
pick one of the data points and record it. We then repeat this process M times and get M 
recorded data points. These M points form one resampling set of the original data set. Now, 
we apply the ECGMM to this new data sample and measure the corresponding parameters. 
After this, we start a second round, getting another resampling set with M data points in it 
and measure the parameters using ECGMM again. We repeat this process X times and have 
X estimates of each parameters. We throw away those outlier estimates (estimates beyond 
the upper and lower inner fences^) for each parameter and use the mean of good estimates 



2 In statistics, lower inner fence is defined by Q\ — 1.5IQR and higher inner fence is Q3 + 1.5IQR, where 
Qi and Q3 are the first and third quartiles respectively. The IQR is the the interquartile range, defined as 
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as the value of each parameter. Using this scheme, our resulting parameter estimates are 
much more robust, at a cost of a tolerable increase in computation time. In this application, 
we took X to be 50. 



2.4. Monte Carlo test of the ECGMM for our application 

Before we delve into real data, we first conduct Monte Carlo tests to see whether the 
ECGMM approach can reliably identify the cluster and background Gaussian components. 
These tests are used to determine whether this method can reliably recover the true pa- 
rameters input in the simulation, and to see whether the extracted parameters are generally 
unbiased with respect to measurement errors. 

For this purpose, we generate two Gaussian random data sets, one representing cluster 
member colors, denoted CL, and the other representing the field galaxies/blue galaxies' 
colors, denoted as BG. The CL set is generated from ~ iV(0.5, 0.04 2 ) and BG set is generated 
from ~ iV(0,0.3 2 ). To represent clusters with different richness, we allow the normalization 
(also denoted as N ga i s in the plots) of CL data set to vary as 10, 15, 20, 25, 30, 40, 50, 60 
and 70 while keep the normalization of BG set as 30. All the parameters used to generate 
the mock data are chosen to make the simulation as close to the real data as possible. Then 
we combine CL and BG to create a mock data set that mimics the colors of both cluster 
members and background galaxies in a field. It is worth noting that these mock colors are 
error free so far. Next, we will add some noise to them to mimic the measurements errors. 
To do this, we first generate random numbers from a uniform distribution in the range of 
[0, 0.1], which play the role of 8j in Eqf2J Then, we generate from N(0, S"j) and add them to 
the noise free data set to produce a noise added mock color data set. In FigJH we plots the 
results from the ECGMM fitting. The results show that for clusters with N ga i s > 10, the 
method can recover the locations (n) and widths (a) of the Gaussian components very well. 

Next we test for possible bias in the estimators. For each cluster richness N ga i s , we re- 
generate the data as well as errors 200 times and then apply our methods to each to obtain 
estimates for the parameters. In each case, we calculate the bias of parameters 9 (the a and fi 
in our case) defined as E(§)-8. In FigfJ, we plot the results from both GMM and ECGMM 
for comparison. Clearly, the introduction of error correction(as shown in the bottom two 
panels) is essential for removal of the bias of the width resulting from measurement error(as 
shown in the top two panels). 



Qs - Qi. 
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M cl = 0.499 
Mbg =0.012 
<7 cl =0.037 
a h =0.267 



Ngals= 1 5 
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(i h =0.280 



Ngals = 20 
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Ngals = 40 
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// b =0.036 
<r =0.030 
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Mock Color 



Fig. 1. — Basic Monte Carlo tests of the ECGMM method for fitting mock galaxy g — r 
color distributions (see text), /i and a denote the locations and widths of the corresponding 
Gaussian components, for clusters of increasing richness. The true /i are and 0.5 for BG 
and CL sets respectively. The true a are 0.3 and 0.04 for BG and CL sets respectively. 
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Fig. 2. — Monte Carlo test of the bias, (E(6) — 6), of the estimators for the location and 
width using GMM (bottom two panels) and ECGMM (top two panels) as a function of 
richness for the cluster component of the mock clusters (see text). The scales of the plots 
are chosen to be close to the size of the true parameters in the plot to illustrate the fraction 
precision. 

3. Data 
3.1. SDSS 



Three main resources are included in this work: the SDSS galaxy catalog, the maxBCG 
catalog, and a value-added SDSS spectroscopic catalog. 

The maxBCG cluster sample and the galaxy catalogs used to remeasur e cluster richness 
in this paper are derived from the SDSS ( lAdelman-McCarthy et al.ll2006l ). Th e maxBCG 
clust er sample covers a sky area of about 7500 square degrees. The camera design (IGunn et al. 
20061 ) and drift-scan imaging strategy of the SPSS enable acquisit ion of nearly si multaneous 



observations in the u, q, r,i,z filter system (IFukugita et a. 



200 



Smith et al. 120021 ; iTucker et al.l 120061 ) , astrometric (IPier et al.l 120031 ) , and photomet- 



1996). C alibra tion (IHogg et al.l 



ric (ILupton et al.l 120011 ) pipelines reduce the data into object catalogs containing a host of 
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measured pa rameters for each ob ject. Galaxies are selected from SDSS object catalogs as 
described in (jSheldon et al.l 120071 ). In this work we use CM0DEL_C0UNTS as our total magni- 
tudes, and M0DEL_C0UNTS when computing colors . Bright stars, survey edges and regions o f 
poor seeing are masked as previously described (IKoester et al.ll2007at ISheldon et al.l 120071 ). 

The spectroscopic galaxy catalog is comprised of galaxies from the DR6 of SPSS Value 



Added Galaxy Catalog. A detailed description about this catalog can be found in lBlanton et al. 
(hoo.4 VAGC). 



3.2. Cluster Sample 



We obtain sky locations, redshift estimates, and initial richness values from the maxBCG 
cluster cat alog. Details of the selection algorithm and catalog properties are published 
elsewhere ( Koester et al. 2007bl Jah . In brief, maxBCG selection relies on the observation 
that the galaxy population of rich clusters is dominated by luminous, red galaxies clustered 
tightly in color (the E/S0 ridgeline). Since these galaxies have old, passively evolving stellar 
populations, their g — r color closely reflects their redshift. The brightest such red galaxy, 
typically located at the peak of the galaxy density, defines the cluster center. 

The maxBCG catalog is approximately volume limited in the redshift range 0.1 < z < 
0.3, with very accurate photometric redshifts (5z ~ 0.01). Studies of the maxBCG algorithm 
appli ed to mock SPSS cata logs indicate that the completeness and purity are very high, above 
90% (IKoester et al.ll2007al ). The maxBCG catal og has been used to investigate the scaling of 
galaxy velocity dispersion with cluster richness (IBecker et al.ll2007l ) and to derive constrai nts 
on the power spectrum normalization, dg, from cluster number counts (IRozo et al.l 120091 ) . 



4. Measuring the ridgeline location and width of maxBCG clusters 



We apply t he above prescription s of ECGMM to the maxBCG cluster catalog and the 
galaxy catalog (IKoester et al.ll2007a ). measuring the red sequence g — r ridgeline. The pro- 
cedures are as follows: for each cluster in maxBCG catalog, we choose a scaled aperture 
i?2oo s to ensure we are considering equivalent regions of clusters of varied masses and there- 
fore varied richness, -R^oo* i s the critical radius, interior to which the mean mass density 
of th e cluster is 200 times of the critical ener gy density. Based on the weak lensing anal- 
ysis (IJohnston et al.l 120071 : lHansen et al.l 120071 ) , the scaling relation between i^ocf an d the 

^200 ) 

Mpc to 1.68 Mpc. 



original maxBCG richness A^oo is given by R l ^ s = 0.182(A r 2 oo) ' 42 ) which ranges from 0.47 
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Next, we identify all SDSS galaxies inside this aperture range, fainter than the BCG, 
and brighter than an i ba nd magnitude corresponding to 0.4 L* at the redshift of the clus- 



ter (IKoester et al.l l2007al ) . Then, we apply the ECGMM procedure to the g — r colors and 
corresponding measurement errors of these galaxies. One of the resulting two Gaussian com- 
ponents from the ECGMM will represent the cluster red sequence color distribution while the 
other represents the background/blue galaxy color distribution. To determine which Gaus- 
sian Component belongs to the cluster, we calculate the likelihood of the BCG's g — r color 
on each Gaussian Component. The component for which the BCG has a higher likelihood 
is assigned as the cluster component and the other is declared background. By this way, 
each maxBCG cluster gets a new richness, N!§^ s . It is worth noting that we apply the above 
measurements to all maxBCG clusters whose original iV200 > 10. But we will only continue 
our analysis on a subsample of the clusters whose new measured richness A^qo* — 10 and 
have two identified Gaussian mixture components in order to guarantee the reliability of our 
measurements. After this selection, we are left with about 7100 clusters and all our further 
analysis are based on them. We need to point out that the clusters falling outside of this 
selection are not necessarily bad clusters. They are just fall below the richness threshold we 
imposed for quality control. In FigJ3]and FigJU we show the ECGMM fitting of 9 big and 9 
small clusters as described above. Their corresponding CMRs are plotted in Fig 15] and FigJHJ 

For comparison, we measure the red-sequence location and width using both ordinary 
GMM and ECGMM. The top panel of Fig. [7] shows the evolution of the average g-r ridgeline 
location and width measured using ordinary GMM. We observe the well-known trend in the 
average ridgeline zeropoint, and there is additional apparent strong evolution in the average 
ridgeline width, which becomes nearly 140% larger by z = 0.3. However, from the lower two 
panels which are measured using ECGMM, one can see very clearly the power of ECGMM 
in constraining the intrinsic width of the ridgeline without contamination from measurement 
error. The results show that the mean observed g—r ridgeline location retains the same linear 
dependence on redshift while the mean scatter of the ridgeline shows a weak dependence on 
redshift, with the g - r scatter a{z = 0.1) = 0.051 ± 0.003 and a(z = 0.3) = 0.079 ± 0.005 
or a broadening by ~ 55% from z — 0.1 to z — 0.3. The strong dependence of the scatter 
on redshift from the GMM is mostly due to the increased measurement errors for cluster 
members at higher redshift. 
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g — r 

Fig. 3. — The ECGMM fitting to the galaxy color distribution around 9 rich clusters. Note 
the fact that we corrected for measurement errors, the two Gaussians appear to be narrower 
than the histogram. 
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Fig. 4. — The ECGMM fitting to the galaxy color distribution around 9 small clusters. 
Note that when there are fewer galaxies histogram is no longer a good way to show the 
distribution. 
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Fig. 5. — The CMR around the 9 rich clusters in Figj3j The red diamonds and error bars are 
those from the selected members (red sequence) and the blue dots and error bars are those 
from field galaxies. The red cross symbol represent the BCG of that cluster. All the galaxies 
are within -R^oT aroun d the BCG, fainter than BCG but brighter than the 0.4L*(see text). 
The green line is a weighted least square fit (weighted by the inverse square of color errors) 
to the cluster galaxies. 
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Fig. 6. — The CMR around the 9 small clusters in FigJH 
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Fig. 7. — Tracking the (g — r) red sequence zeropoint and width as a function of redshift, 
measured using ordinary GMM (upper panels) and ECGMM (lower panels) respectively. We 
bin the measured ridgeline color and width into redshift bin of 0.04 and then fit the means 
with a straight line. After error correction, the broadening of the observed red-sequence 
width with redshift is greatly suppressed, revealing the effect of photometric errors on the 
observed broadening. 



-17- 



5. The red sequence ridgeline slope 
5.1. Ridgeline slope from galaxy clusters 

It has been pointed out that the color-magnitude relat i on (CMR) of cluster member 



galaxies has a negative slope (e.g lKodama &: Arimotolll997l ; iGladders et al.lll998l ). so that 



fainter member galaxes are generally bluer. The evolution of these CMR slopes with respect 
to redshift and richness has been difficult to address, largely due to the lack of a sufficiently 
large cluster catalog with well measured photometry for all its galaxies. The maxBCG 
catalog provides about 14,000 galaxy clusters, extending over 0.1 < z < 0.3, which enables 
us to measure the slope of the CMR for clusters with good statistics across a range in both 
richness and redshift. 

Measurement of the slope of the CMR typically proceeds by identification of the cluster 
red-sequence, followed by some iterative process of outlier removal, and a determination of 
cluster "member" galaxies which are then used to measure the slope and zeropoint of the 
cmrE We apply the method described in previous sections to measure the color distribution 
of individual clusters and to assign the memberships for every cluster by requiring the color 
difference between the member galaxies and ridgeline within ±2<r (cr is the convolved ridgeline 
width, given by the best-fit ECGMM, and the measurement errors of individual member 
galaxy's color). The richness of the cluster measured by this way is denoted as N^ s . We 
choose 2cr because this is roughly where the background component's likelihood dominates 
over the cluster component's likelihood. Based on this identification of membership driven 
by the ECGMM, we fit for the CMR of clusters galaxies with a straight line using weighted 
least square fitting. The weights we used are the inverse square of the measurement errors 
of g — r. We call the slope of the fitted line as the slope fo the ridgeline. 

The distribution of ridgeline slopes for maxBCG clusters is shown in FigJH] in bins of 
Az = 0.03. Despite the substantial scatter in slope among individual clusters, we can see 
from FigJH] and Figj9] that the mean slope of the red sequence ridgelines for clusters deviates 
from zero for 0.1 < z < 0.3. For any bin, the error on the mean places the measurement 
many standard deviations from zero. 

In FigJHl it is apparent that the observed trend of the mean ridgeline slope with redshift 
is statistically significant: the slope becomes steeper by a factor of 2.5 by z = 0.3. In FigfiOj 
we plot the evolution of the slopes vs richness in each redshift slice, which shows that the 
dependencies of the ridgeline slope with respect to richness is weak, as shown elsewhere (e.g. 



3 In lAndreonl (|2006f ). a slightly different method was introduced by directly modeling the CMR and 
measurement errors into the likelihood function without separating the red sequence galaxies first. 
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Fig. 8. — The distributions of measured ridgeline slopes for clusters in steps of 0.03 in redshift. 
/i and a denote the mean and width of the distribution. The dashed line corresponds to 
zero. 
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Fig. 9. — Tracking the observed red-sequence slope vs redshift. The gray clouds represent 
the slope measurements from individual clusters. The black solid circles and error bars 
are weighted mean and the standard deviation to the weighted mean for each redshift bin 
(A = 0.03). Note that the error bars in the plot are smaller than the symbols. 



Hogg et al.l 12004 ). Clearly, the observed slope of the red-sequence is not associated with 
cluster richness, and is unsurprisingly a strong function of redshift (see Discussion). 



5.2. Ridgeline slope from spectroscopic data 

The above measurement is based only on a photometric determination of red sequence 
galaxies. The level to which projection plays into this selection is as yet unknown. The true 
red-sequence galaxy population in some physical volume, either in a cluster or in the field, 
is contaminated by dusty foreground galaxies which can be rejected via spectroscopy, and 
by the peculiar velocities of the galaxies themselves. 
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To address the possibility of foreground contamination, it is interesting to see if the 
above results are preserved in a spectroscopic sample of galaxies. To achiev e this goal, we 



use g alaxies with spectra from DR6 of SDSS Value Added Galaxy Catalog (IBlanton et al. 



20051 . VAGC) for a comparison. Due to the selection effects of the spectroscopic data, we 
will choose only the galaxies in redshift from 0.1 to 0.2 and brighter than 0.4 L* magnitude 
at the ir respective redsh ifts. By extension from the photometric sample and from previous 



work (IHogg et al.l 120041 ). we know that the slope does not vary with environment, so the 
field sample represented by our spectroscopy should be a fair representation of the expected 
slope in clusters. 

Our procedures are as follows: we first bin the galaxies into bins of size Az = 0.003, 
which corresponds to velocity slices of 900km/sec. The color distribution of the galaxies in 
each bin shows clear bimodality (top panel of Fig. [TTi) . Then, we separate the red sequence 
galaxies in each bin using ECGMM. The red sequence galaxies correspond to the Gaussian 
component with bigger g — r value and we choose ±2cr from the peak location as red galaxy 
samples for each redshift slice, in a fashion similar to the one we used for cluster galaxies. 
Then, in every bin, we fit the CMR of galaxies' g — r colors and z-band magnitude with 
a straight line using weighted least square fitting with the weights come from the inverse 
sequare of the color measurement errors. The weights are the inverse square of the g — r 
measurement errors. We record the corresponding slopes. In the bottom panel FigJTTI we 
choose 6 redshift bins (Az = 0.003) to illustrate the red/blue galaxy separation and the 
ridgeline slope fitting in each bin. Finally, we fit the variation of slope with redshift with a 
line to look for a trend, the results are shown in the left panel of FigJT2"l As a comparison 
to the cluster sample, we also plot the mean variation of the ridgeline slope from clusters in 
the same redshift range [0.1, 0.2] in the right panel of FigJT2l When the redshift range is 
changed, the slope of the fitted line for the cluster sample becomes steeper as compared to 
FiglHJ The reason lies in that the linear fit to the trend is only the first order approximation. 
But for our purpose here, we just need to require the cluster sample and spectroscopic sample 
on the same redshift range so as to compare them fairly. 

Comparing the two panels in FigfT2l shows that the two slopes from spectroscopic sample 
and cluster sample are not different in a statistically significant way. This further confirms 
our previous observation that the cluster environment will not affect the slope. 



6. Discussion 



To this point, the measurements have been presented in the observed frame. Photo- 
metric cluster detection and the quantities derived (e.g. richness) operate in the frame of 
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the observer, and predictions from galaxy formation models and mock galaxy catalogs can 
be evaluated in light of these precision measurements. They have particular applicability to 
calibration of optical cluster detection efforts, especially to those that rely on the properties 
of the red sequence. The methodologies developed herein allow the "bootstrapping" of op- 
tical algorithms: basic cluster finders locate the clusters, and precision measurements (such 
as these) of said clusters lead to refinements in those algorithms. The extent to which each 
of these agrees with previous measurements from spectroscopy, other cluster samples, and 
simulations is left for future work. However, for illustrative purposes, we list the relevant 
observational considerations to be made in understanding the context of these measure- 
ments with respect to previous work in the literature, and then highlight a few of our more 
interesting results. 

In general, there are five places where the comparison to previous work must be treated 
with caution, which can be summarized as follows: 1) redshifting of the galaxy spectra 
through the bandpasses under consideration, which imp arts trends in the ob served colors, 
2) selection effects imposed by the color selection (e.g. iFranzetti et al.l 120071 ). 3 ) aperture 



effects, i.e. the aperture u sed to measure the color in different banspasses (e.g. IScodeggio 



200ll : iBlakeslee et al.ll2006l ). 4) projection effects. 5) actual evolution in the red sequence. 



To some level, any of the aforementioned issues may play into our results: (i) at z ~ 0.1, 
the CMR of photometrically-selected galaxies is noti ceably shallower than previous spe ctro- 
scopic measurements of the color magnitude relation ( Hogg et al. 2004 : Cool et al. 20061 ). (ii) 
the slopes are almost independent of cluster richness; (iii) the photometric error-corrected 
scatter of the red-sequence broadens mildly with redshift; (iv) the observed mean slope of 
the CMR is negative and it becomes more negative as redshift increases. 

Naively, we expect that our measurement of the slope of the red-sequenc e, —0.013 ± 



0.0003 mags mag 1 at z = 0.1, corresponds to the SDSS spectroscopic analysis of iHogg et al. 



(120041) . for whi c h the slope is -0.022 mags mag in (g — r) 



Hogg et al.l (120041 ) measurements are k-corrected to the z 



In addition to the fact that 
- 0.1 rest-fr ame, one possible 



the 

difference comes from our definition of the red sequence: IHogg et al.l (120041 ) use a 2cr clipping 
algorithm to define the red sequence and to iteratively reject outliers. While they split 
the sample by Sersic index, sigma-clipping may be more permissive of objects near the 
"blue cloud" to be included in the red-sequence, while the method presented in this paper 
automatically accounts for the presence of these objects. Our slope measurements at a given 
redshift may also be biased shallow, as the initial 2 a cut derived from the ECGMM fit does 
not account for the slope in the red-sequence itself, i.e. the cut is applied in the same way 
regardless of magnitude. Ideally, an iterative procedure would be employed to determine the 
best-fit line for each cluster and the 2a cut would be applied as a function of magnitude. 
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Unfortunately, the small number statistics for low richness clusters do not permit this to be 
implemented in a robust fashion. 

Insofar as richness and local density are similar indicators of environment, the second 
obser yation (ii) that the slope is almost independent of environment is in basic agreement 
with iHogg et al.l (12004 ). who use SDSS spectroscopy at z ~ 0.1 to compare galaxies with 
high (n > 2) Sersic indicies in different environments characterized by their local density. 

After the photometric error correction performed by ECGMM, a trend in the scat- 
ter with redshift remains (iii), such that the scatter increases with increasing redshift. At 
high redshift z__~ 1, the color-magnitude relation has been measured in a handful of clus- 



ters ( IMei et al.ll2009l ; iKoester et al.ll2009l ; ISantos et al.ll2009l . e.g.) with the general conclusion 



that the restframe scatter in the CMR does not evolve with redshift. More locally, the SDSS 
Luminous Red Galaxy (LRG) Sa mple has been us e d to measure vario us redshifted frames of 
bright (L > 2.2L*) red galaxies (jCool et al.ll2006l ). ICool et al.l (120061 ) find the intrinsic rest- 
frame scatter 0,16 (g — r) = 35.4 ± 3.7 and °' 37 (g — r) = 43.5 ± 6.2 mmags, consistent with no 
evolution. However, with increased cluster sample, our observed frame measurements reveal 
an increase in the scatter, shown by a statistically significant non-zero slope (the bottom 
right panel in Fig. [7]). 



Result (iv) is in qualitative agreement with the results in (IGladders et al.lll998l ) who 
find a similar trend in the slope for a sample 44 Abell clusters at z < 0.15 and 6 clusters at 
0.2 < z < 0.75, the largest previ ous study of its kind. In their study of the scatter of the 
CMR in LRGs, ICool et al.l (120061 ) report no significant trend with redshift in the rest-frame 
slope of LRGs over 0.16 < z < 0.37 in either the cluster or the field, but caution that the 
sample is not-well suited to measuring the slope. The observed factor of 2.5 increase in the 
magnitude in our measurement of t he slope is likely due to a combination of the lack of k- 
corrections and selection effects (e.g. iFranzetti et al.ll2007l ) derived from color cuts that may 
preferentially include a larger and larger fraction of galaxies with significant star-formation 
at increasing redshifts. 

A fu rther contribution t o the inflat ed slope may com e from the choice of the color 
aperture. Ivan Dokkum et al.l (119981 ) and IScodeggiol (120011 ) note the importance of the use 
of adaptive apertures, which place the color measurements of large and small galaxies on 
the same footing. This point motivates our choice of M0DEL_MAGS from the SDSS, which 
are derived from the best-fit convolution of the local PSF with a deVaucoleurs model in the 
r-band. This same best-fit model is then used to compute the flux in both the g and r-bands. 

As our results indicated, the intrinsic ridgeline slope may decrease as redshift increase. 
Though there are many factors that may complicate the interpretation of this results as we 
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discussed in the preceding paragraphs, it is still worth to speculate wh at may lead to th e 
intrinsic evolution. In general, galaxies are more active at higher redshift (jCowie et al.lll996l ). 
and their color distributon spreads wider and toward bluer end. Therefore, clusters at higher 
redshift are more likely to have member galaxies with wider and bluer color distribution. As 
a result, when we fit the CMR with a straightline, we tend to have more negative slopes at 
higher redshift. 



7. Summary 

In this paper, we have presented the ECGMM, a new purely photometric method which 
characterizes the red sequence ridgeline in cluster samples with large statistics. This provides 
precise measures of the mean variation of the red sequence ridgeline location and scatter 
(width) with respect to redshift, properly corrected for photometric errors. The measured 
slopes, scatters, and zeropoints are directly applicable to improved cluster finding efforts and 
to characterization of known galaxy clusters. 

Applying the method to maxBCG clusters approximately recovers known properties of 
the red sequence, namely its slope and the variation of the slope with redshift, and the in- 
sensitivity of the slope to environment. It also suggests that the scatter of the red-sequence 
increases mildly with redshift, and that the slope of the red-sequence grows substantially by 
z — 0.3, but we caution that these observed trends may be attributable to a host of obser- 
vational effects that we have made no attempt to correct. Color selection effects, the lack 
of k-corrections, and the details of the measurement of the individual cluster CMRs require 
proper attention before applying these results to models of galaxy formation. Nonetheless, 
these measurements can serve as an important observational check on simulation and mock 
galaxy catalogs. 
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A. The recursive relation for the error corrected Gaussian Mixture Model 

In this appendix, we show the derivation of the likelihood function Eq. [2] and the EM 
recursive relations for the error corrected GMM. To begin with, we introduce the following 
notations in TableO For brevity, we denote the parameters (fii, Oi and wi) collectively by 9 
and (t) represents the t th iteration. M represents number of data points and N represents 
the number of mixtures. 

Since we assume the true color distribution can be approximated by mixture of Gaussian 
distributions, we have the following probability density function for p(ijj\9): 



Though the true colors are not directly observable, we know that its distribution given the 
observed colors and measurement errors is approximately Gaussian: 




N 



(Al) 



Notations 



Meaning 



01, Ui, a n 



VI,..., Vj, y m . 
VI,—, Vj, —, Vm- 

Z\ , ... , Zj , . . . , Z m '. 




Observed colors of BCGs and member galaxies. 
True colors of BCGs and member galaxies. 

Hidden variables that tell which Gaussian component the yj is sampled from. 

Measurement errors for every yj . 

Mean of each Gaussian component. 

Width of each Gaussian component. 

Weights of corresponding Gaussian components. 



Table 1: The notations used in our derivation of ECGMM algorithm 
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(A2) 



then, the likelihood function (under the flat priors for 6) 

L (Q\yj) = I p(yj\o)p(yj\yj)dyj 



(A3) 



After integrating over yj and extending to all data points dX^), we arr i ve at Eqj2j The 
optimal parameters could be obtained by maximizing the above likelihood. However, if we 
introduce hidden variables, z, that tell us which Gaussian component the yj is sampled from, 
then the whole maximization process could be significantly simplified. The corresponding 
pdf of data given z and 9 is 



M 



M 



p{y\z 3 = = Hpiy^ = •,*<*>) = J] 
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: exp 
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2K (t )2 + 5]) 



(A4) 



We use u>j denote the weight of each Gaussian Component in the mixture and is given by 
Wi = p(zj = i\ff). The estimation of hidden variable could be related to Eq lA4l by Bayes' 
Theorem as following: 



P ( Zj = i, Vj \o {t) ) _ v{y d \z 3 = i,e®) P ( Zj = i\e®) 



(A5) 



p(y 3 \o {t) ) ZZMvjte = i,o®)v(*i = AW) 

The EM algorithm iteratively update the parameters 9 by maximizing the expected log 
likelihood 



N M 



i=l j=l ^ 1 3 ' 

(A6) 



under the constraint Y2i=iP( z j = = 1- Using the Lagrange Multiplier approach, we 

redefine 



N 



Q(6)=Q(0)-\ X>(*; = ^ (t) )-1 



(A7) 



i=i 
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with A as the multiplier. 



dQ{6) 



M 



fJLi 



From Eq JA8[ we can arrive at the following recursive relation for [i: 



,(*) 2 > 



Similarly, we have 



gg(g) 



M 
3=1 



+ ff/g?) - (yj - At) 



2^ 



(AS 



(A9) 



(A10) 



Note that since <7j and 5j are entangled within the summation, there would not be an simple 
analytic solution for c^. However, since the algorithm is iterative in nature and the major 
contribution for the update of <7j is from — /ij) 2 , we could approximate <7j in 5j/af with 
its value in t th iteration. Then we can solve for the (t + l) th iteration relation for as: 



0" 



?2 /„(*)2> 



1/2 



(AH) 



Our numerical test shows that such an approximation works fine in practice. For Wi = p(zj 
i\9), we have 



dQ(6) 

dwi 



M 



J2p(z j = i\y J ,e ( - t) )/w t -X = 

3=1 



(A12) 



which leads to 



1 M 



(A13) 



3=1 



Using the condition Y^Wj = 1, we have A = M. Substitute A back to Eq. IA13[ we arrive at: 
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«'* (m) = ^Ep(^=<ii/i.«i t) ) ( A14 ) 

3=1 

In the above iteration relations EqjA9j Eq JAllI and Eq JA14l t and t + 1 denote the 
round of iterations. When we ignore the measurement errors 5j, the above recursive relation 
reduces to the standard EM recursive relation for Gaussian Mixture Model. The above 
relations could be easily generalized to multiviate case by simply substituting the data with 
data matrix, mean with mean vector and variance with covariance matrix, which we will 
not repeat the formula here. A C++ class that implements the above algorithm is available 
upon request. 
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Fig. 10. — The evolution of mean ridgeline slope vs richness at different redshift slices. The 
richness bins brackets are chosen as N^ s =[10,20,30,40,60,80,161]. The light dark points 
are from individual clusters. The black solid dots and error bars are weighted mean and 
standard deviation of the slopes in each N ga i s bin for every redshift slice. From the plot, we 
did not see strong trends of the slope evolution w.r.t richness. 



-32 - 



N 
D 



O 



Mean z: 0.1 19460 
Ngals: 5117 
Weights: 0.58, 0.41 
Location: 0.80, 1.02 
Width: 0.18, 0.04 



Mean z: 0.151541 
Ngals: 5164 
Weights: 0.56, 0.43 
Location: 0.84, 1.05 
Width: 0.18, 0.04 



Mean z: 0.146488 
Ngals: 3898 
Weights: 0.55, 0.44 
Location: 0.89, 1.10 
Width: 0.18. 0.04 




Mean z: 0.161502 
Ngals: 3276 
Weights: 0.54, 0.45 
Location: 0.95, 1.16 
Width: 0.19, 0.04 



Mean z: 0.1 76509 
Ngals: 2467 
Weights: 0.52, 0.47 
Location: 0.99, 1.21 
Width: 0.20, 0.04 



Mean z: 0.191490 
Ngals: 1987 
Weights: 0.47, 0.52 
Location: 1.04, 1.27 
Width: 0.21, 0.04 




0.0 0.5 



1.5 0.0 0.5 1.0 1.5 0.0 0.5 



g-r 



Mean z: 0.1 19460 
Ngols: 51 1 7 

Slope: -0.01 66±0. 0002 


Mean z: 0.131541 
Ngals: 5164 

Slope: -0.01 58±0. 0002 


Mean z: 0.146488 
Ngals: 3898 

Slope: -0.01 74±0. 0004 

r «mmm~- 








li i i i i : 

Mean z: 0.161502 

Ngals: 3276 • - 

Slope: -0.0199±0.0005 


] I I I I ! 

Mean z: 0.176509 
Ngals: 2467 

Slope: -0.021 6±0. 0006 


:i i i i i ; 

Mean z: 0.191490 
Ngals: 1987 

Slope: -0.0237±0.0009 
-------- ; ^:,«MitiMM* 






msm?- — 



14 15 16 17 18 14 15 16 17 18 14 15 16 17 18 



— mag 

Fig. 11. — Evaluating the ECGMM-derived red sequence slopes in SDSS spectroscopy of 
field galaxies. The normalized color histograms (top panel) for Az = 0.003 slices in spectro- 
scopic redshift clearly show the presence of the red and blue components in the field galaxy 
distribution. ECGMM is used to separate the two components, the redder of which is to 
measure the CMR (bottom panel). 
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Fig. 12. — The comparison of the evolution of the slopes of CMR for spectroscopic sample 
and cluster sample. Since the spectroscopic sample is biased due to selection effects at 
z > 0.2, we choose both sample in the redshift range from 0.1 to 0.2 and then bin the 
slopes in redshift bin of 0.02. We then fit straight line to the mean slopes in each bin for 
spectroscopic sample and cluster sample respectively. 



